Investigating Outliers Detection Methods for the Iranian Manufacturing Establishment Survey Data
نویسندگان
چکیده
The role and importance of the industrial sector in the economic development specify the necessity of having accurate and timely data for exact planning. As outliers data in establishment surveys are common due to the structure of the economy, the evaluation of survey data by identifying and investigating outliers prior to the release of data is necessary. In this paper the practical application of different robust multivariate outlier detection methods based on the Mahalanobis distance with BACON algorithm, minimum volume ellipsoid (MVE) estimator, minimum covariance determinant (MCD) estimator, Stahel-Donoho estimator is presented. Also some univariate outlier detection methods such as Hadi and Simonoff (1993) method, using some regression models, are presented. These methods are illustrated using a real data set on Iranian Manufacturing Establishment Survey (IMES). These data are collected each year by the Statistical Center of Iran using sampling weights. In this paper it is demonstrated that the use of different robust outlier detection methods (multivariate and univariate), in a number of manufacturing industries, leads to the same results.
منابع مشابه
Outliers and influential observations can have an important effect on work with estimation and inference from establishment survey data. Practical development and implementation of methods to identify and account for outliers and influential observations in complex survey data
Outliers and influential observations can have an important effect on work with estimation and inference from establishment survey data. Practical development and implementation of methods to identify and account for outliers and influential observations in complex survey data require an agency to balance several factors, including: (i) the mathematical statistics properties of detection method...
متن کاملIdentification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملIntroduction Package CircOutlier For Detection of Outliers in Circular-Circular Regression
One of the most important problem in any statistical analysis is the existence of unexpected observations. Some observations are not a part of the study and are known as outliers. Studies have shown that the outliers affect to the performance of statistical standard methods in models and predictions. The point of this work is to provide a couple of statistical package in R software to identi...
متن کاملA statistical test for outlier identification in data envelopment analysis
In the use of peer group data to assess individual, typical or best practice performance, the effective detection of outliers is critical for achieving useful results. In these ‘‘deterministic’’ frontier models, statistical theory is now mostly available. This paper deals with the statistical pared sample method and its capability of detecting outliers in data envelopment analysis. In the prese...
متن کاملA robust wavelet based profile monitoring and change point detection using S-estimator and clustering
Some quality characteristics are well defined when treated as response variables and are related to some independent variables. This relationship is called a profile. Parametric models, such as linear models, may be used to model profiles. However, in practical applications due to the complexity of many processes it is not usually possible to model a process using parametric models.In these cas...
متن کامل